Web Scraping for Online Catalog

upwork.com 🟡 2026-05-02

🔹 Web Scraping for Online Catalog
👤 Client: 🇨🇿 Czech Republic Member since 2014-04-29
💰 Price: $30
🚩 Problem: Scraping a large online catalog with Cloudflare protection within 24 hours.
📦 Existing: [URL]

Specifications:

[Target] - Extract data from 70,000 subpages of an online catalog.
[Method] - Use headless browsers and proxy rotation to bypass Cloudflare.
[UI/UX] - Not applicable.
[Stack] - Python with Scrapy or BeautifulSoup, Selenium for dynamic content, Proxy services like Scrapingant or ScraperDo.
[Security] - Ensure data is handled securely; use encrypted connections and secure storage.
[Format] - Output CSV tables.

Workflow:

1. Set up a headless browser environment with Selenium to handle dynamic content.
2. Implement proxy rotation using services like Scrapingant or ScraperDo to bypass Cloudflare.
3. Write scraping scripts to navigate through pagination and extract URLs for entry details.
4. Develop scripts to scrape data from the 68,000 detail pages.
5. Validate and clean scraped data before exporting to CSV.

⚡ Receive notifications instantly Join our community.

Discord Telegram

Our Social Networks

LinkedIn Twitter Facebook

🕷️️ Job Radar • SCRAPING